A Data-driven Approach to Pronominal Anaphora Resolution for German
نویسندگان
چکیده
This paper reports on a hybrid architecture for computational anaphora resolution (CAR) of German that combines a rule-based pre-filtering component with a memory-based resolution module (using the Tilburg Memory Based Learner – TiMBL). The data source is provided by the TüBa-D/Z treebank of German newspaper text (Telljohann et al. 04) that is annotated with anaphoric relations. The CAR experiments performed on these treebank data corroborate the importance of modelling aspects of discourse structure for robust, data-driven anaphora resolution. The best result with an F-measure of 0.734 achieved by these experiments outperforms the results reported by (Schiehlen 04), the only other study of German CAR that is based on newspaper treebank data.
منابع مشابه
Pronominal Anaphora Resolution in the KANTOO Multilingual Machine Translation System
We present an approach to pronominal anaphora resolution using KANT Controlled Language and the KANTOO multilingual MT system. Our algorithm is based on a robust, syntax-based approach that applies a set of restrictions and preferences to select the correct antecedent. We report a success rate of 93.3% on a training corpus with 286 anaphors, and 88.8% on held-out data with 144 anaphors. Our app...
متن کاملPronominal Anaphora Resolution Using a Shallow Meaning Representation of Sentences
This paper describes a knowledge-poor anaphora resolution approach based on shallow meaning representation of sentences. Within our representation, we define a new local domain which provides a powerful cue for resolving pronominal anaphora. Other information used included syntactic information, syntactic parallelism and salience weights. We collected 111 singular 3 person pronouns from open do...
متن کاملPronominal and Sortal Anaphora Resolution for Biomedical Literature
Anaphora resolution is one of essential tasks in message understanding. In this paper resolution for pronominal and sortal anaphora, which are common in biomedical texts, is addressed. The resolution was achieved by employing UMLS ontology and SA/AO (subject-action/action-object) patterns mined from biomedical corpus. On the other hand, sortal anaphora for unknown words was tackled by using the...
متن کاملModelling pronominal anaphora in statistical machine translation
Current Statistical Machine Translation (SMT) systems translate texts sentence by sentence without considering any cross-sentential context. Assuming independence between sentences makes it difficult to take certain translation decisions when the necessary information cannot be determined locally. We argue for the necessity to include crosssentence dependencies in SMT. As a case in point, we st...
متن کاملA Hybrid Approach to Pronominal Anaphora Resolution in Arabic
Corresponding Author: Abdullatif Abolohom Department of Computer Science, Faculty of Information Science and Technology, University Kebangsaan Malaysia, 43600 Bangi, Selangor, Malaysia Email: [email protected] Abstract: One of the challenges in natural language processing is to determine which pronouns to be referred to their intended referents in the discourse. Performing anaphora resolution ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005